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We present here a new and homogeneous sample of 3340 galaxies selected 
from the Sloan Digital Sky Survey (SDSS) based solely on the observed strength 
of their H5 Hydrogen Balmer absorption line. The presence of a strong H5 line 
within the spectrum of a galaxy indicates that the galaxy has under-gone a 
significant change in its star formation history within the last Gigayear. There- 
fore, such galaxies have received considerable attention in recent years, as they 
provide an opportunity to study galaxy evolution in action. These galaxies are 
commonly known as "post-starburst" , "E+A", "k+a" and H<5-strong galaxies, 
and the study of these galaxies has been severely hampered by the lack of a large, 
statistical sample of such galaxies. In this paper, we rectify this problem by se- 
lecting a sample of galaxies which possess an absorption H<5 equivalent width of 
EW(H5 max ) - AEW(H<5 max ) > 4Afrom 106682 galaxies in the SDSS. We have 
performed extensive tests on our catalog including comparing different method- 
ologies of measuring the H5 absorption lines and studying the effects of stellar 
absorption, dust extinction and emission-filling on our measurements. We have 
determined the external error on our US measurements using duplicate observa- 
tions of 11538 galaxies in the SDSS. The measured abundance of our H<5-selected 
(HDS) galaxies is 2.6 ± 0.1% of all galaxies within a volume-limited sample of 
0.05 < z < 0.1 and M(r*) < —20.5, which is consistent with previous studies of 
such galaxies in the literature. We find that only 25 of our HDS galaxies in this 
volume-limited sample (3.5 ±0.7%) show no, or little, evidence for [On] and Ha 
emission lines, thus indicating that true E+A (or k+a) galaxies (as originally 
defined by Dressier & Gunn) are extremely rare objects at low redshift, i.e., only 
0.09 + 0.02% of all galaxies in this volume-limited sample are true E+A galaxies. 
In contrast, 89 ± 5% of our HDS galaxies in the volume-limited sample have 
significant detections of the [On] and Ha emission lines. Of these, only 131 
galaxies are robustly classified as Active Galactic Nuclei (AGNs) and therefore, 
a majority of these emission line HDS galaxies are star-forming galaxies. We 
find that 52 ± 12% (27/52) of galaxies in our volume-limited HDS sample that 
possess no detectable [Oil] emission, do however possess detectable Ha emission 
lines. These galaxies may be dusty star-forming galaxies. We provide the com- 
munity with this new catalog of H5-selected galaxies to aid in the understanding 
of these galaxies, via detailed follow-up observations, as well as providing a low 
redshift sample for comparison with higher redshift studies of HDS galaxies. We 
will study the global properties of these galaxies in future papers. 
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1. Introduction 

The presence of a strong H5 absorption line (equivalent width of ^ 5A) in the spectrum 
of a galaxy is an indication that the spectral energy distribution of that galaxy is dominated 
by A stars. Models of galaxy evolution indicate that such a strong B.5 line (in the spectrum of 
a galaxy) can only be reproduced using models that include a recent burst of star formation, 
followed by passive evolution, as any on-going star-formation in the galaxy would hide the 
B5 absorption line due to emission-filling (of the B5 line) and the dominance of hot O and B 
stars, which have intrinsically weaker B5 absorption than A stars (see, for example, Balogh et 
al. 1999; Poggianti et al. 1999). Therefore the existence of a strong B5 absorption line in the 
spectrum of a galaxy suggests that the galaxy has under-gone a recent transformation in its 
star-formation history. In the literature, such galaxies are called "post-starburst" , "E+A", 
k+a, and H5-strong galaxies and the exact physical mechanism(s) responsible for the abrupt 
change in the star formation history of such galaxies remains unclear. These galaxies have 
received much attention as they provide an opportunity to study galaxy evolution "in action". 

H5-strong galaxies were first discovered by Dressier & Gunn (1983, 1992) in their spec- 
troscopic study of galaxies in distant, rich clusters of galaxies. They discovered cluster 
galaxies that contained strong Balmer absorption lines but with no detectable [On] emis- 
sion lines. They named such galaxies "E+A" , as their spectra resembled the superposition of 
an elliptical galaxy spectrum and A star spectrum. Therefore, E+A (or k+a) galaxies were 
originally thought to be a cluster-specific phenomenon and several physical mechanisms have 
been proposed to explain such galaxies. For example, ram-pressure stripping of the intra- 
stellar gas by a hot, intra-cluster medium, which eventually leads to the termination of star 
formation once all the gas in the galaxy has been removed, or used up (Gunn & Gott 1972; 
Farouki & Shapiro 1980; Kent 1981; Abadi, Moore & Bower 1999; Fujita & Nagashima 1999; 
Quilis, Moore & Bower 2000). Alternative mechanisms include high-speed galaxy-galaxy 
interactions in clusters (Moore et al. 1996, 1999) and interactions with the gravitational 
potential well of the cluster (Byrd & Valtonen 1990; Valluri 1993; Bekki, Shioya & Couch 
2001). 

To test such hypotheses, Zabludoff et al. (1996) performed a search for E+A (or k+a) 
galaxies in the Las Campanas Redshift Survey (LCRS; Shectman et al. 1996) and found that 
only 21 of the 11113 LCRS galaxies they studied, satisfied their criteria for a E+A galaxy. 
This work clearly demonstrates the rarity of such galaxies at low redshift. Furthermore, 
Zabludoff et al. (1996) found that 75% of their selected galaxies reside in the field, rather than 
the cores of rich clusters. This conclusion was confirmed by Balogh et al. (1999), who also 
performed a search for EW-strong galaxies in the redshift surveys of the Canadian Network 
for Observational Cosmology (CNOC; Yee, Ellingson, & Carlberg 1996), and found that the 
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fraction of such galaxies in clusters was consistent with that in the field. Alternatively, the 
study of Dressier et al. (1999) found an order-of-magnitude increase in the abundance of 
E+A (or k+a) galaxies in distant clusters compared to the field (see also Castander et al. 
2001). Taken together, these studies suggest that the physical interpretation of H5-strong 
galaxies is more complicated than originally envisaged, with the possibility that different 
physical mechanisms are important in different environment, e.g., 5 of the 21 E+A galaxies 
discovered by Zabludoff et al. (1996) show signs of tidal features, indicative of galaxy-galaxy 
interactions or mergers. Furthermore, redshift evolution might be an important factor in the 
differences seen between these surveys. 

In addition to studying the environment of FK-strong galaxies, several authors have 
focused on understanding the morphology and dust content of these galaxies. This has been 
driven by the fact that on-going star formation in post-starburst galaxies could be hidden 
by dust obscuration (See Poggianti & Barbaro 1997; Poggianti et al. 1999; Bekki et al. 2001 
for more discussion). In fact, Smail et al (1999) discovered examples of such galaxies using 
infrared (IR) and radio observations of galaxies in distant clusters. They discovered five 
post-starburst galaxies (based on their optical spectra) that showed evidence for dust-lanes 
in their IR morphology, as well as radio emission consistent with on-going star formation. 
However, radio observations of the Zabludoff et al. (1996) sample of nearby E+A galaxies 
indicates that a majority of these galaxies are not dust-enshrouded starburst galaxies. For 
example, Miller & Owen (2001) only detected radio emission from 2 of the 15 E+A galaxies 
they observed, and the derived star-formation rates (SFRs) were consistent with quiescent 
star formation and thus much lower than those observed for the dust-enshrouded starburst 
galaxies of Smail et al (1999). Chang et al. (2001) also did not detect radio emission from 
5 E+A galaxies they observed from the Zabludoff et al. (1996) sample and concluded that 
these galaxies were not dust-enshrouded starbursts. In summary, these studies demonstrate 
that some E+A galaxies have dust-enshrouded star formation, but the fraction remains ill- 
determined. Furthermore, it is unclear how these different sub-classes of galaxies are related, 
and if there are any environmental and evolutionary processes at play. 

The interpretation of H5-strong galaxies (E+A, k+a, etc.) suffers from small number 
statistics and systematic differences in the selection and definition of such galaxies between 
the different surveys constructed to date. Therefore, many of the difficulties associated with 
understanding the physical nature of these galaxies could be solved through the study of a 
large, homogeneous sample of H<5 galaxies. In this paper, we present such a sample derived 
from the Sloan Digital Sky Survey (SDSS; York et al. 2000). The advantage of this sample, 
over previous work, is the quality and quantity of both the photometric and spectroscopic 
data, as well as the homogeneous selection of SDSS galaxies which covers a wide range of 
local environments. 
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We present in this paper a sample of galaxies that have been selected based solely on 
the observed strength of their H<5 absorption line. Our selection is thus inclusive, containing 
many of the sub-classes of galaxies discussed in the literature until now, e.g., "E+A" or 
"k+a" galaxies (Zabludoff et al. 1996; Dressier et al. 1999), post-starburst galaxies, dust- 
enshrouded starburst galaxies (Smail et al. 1999), H5-strong galaxies (Couch & Sharpies 
1987) and the different subsamples of galaxies (i.e., e(a), A+em) discussed by Poggianti et 
al. (1999) and Balogh et al. (1999). Therefore, to avoid confusion with other samples in the 
literature, we simply call this sample of SDSS galaxies; "H<5-selected" (HDS) galaxies. 

In this paper, we present the details of our selection and leave the investigation and 
interpretation of these HDS galaxies to subsequent papers. We publish our sample of HDS 
galaxies to help the community construct larger samples of such galaxies, which are critically 
needed to advance our understanding of these galaxies, as well as aiding in the planning of 
follow-up observations and comparisons with higher redshift studies of such galaxies. 

In Section 2, we present a brief discussion of the SDSS and the data used in this paper. 
In Section 3, we discuss our techniques for measuring the H5 absorption line and present 
comparisons between the different methodologies used to measure this line. In Section 4, we 
discuss the criteria used to select of our HDS sample of galaxies and present data on 3340 
such galaxies in our catalog. In Section 5, we compare our sample of galaxies with those 
in the literature. A more detailed analysis of the properties of our HDS galaxies will be 
discussed in subsequent papers. The cosmological parameters used throughout this paper 
are H =75 km s _1 Mpc -1 , Q m = 0.3 and Q A = 0.7. 



2. The SDSS Data 

In this Section, we briefly describe the spectroscopic part of the SDSS. As discussed 
in York et al. (2000), the SDSS plans to obtain spectra for ~ 10 6 galaxies to a magnitude 
limit of r* = 17.7 (the "Main" galaxy sample; Strauss et al. 2002), ~ 10 5 Luminous Red 
Galaxies (LRG; Eisenstein et al. 2001) and ~ 10 5 quasars (Richards et al. 2002). The reader 
is referred to Fukugita et al. (1996), Gunn et al. (1998), Lupton et al. (1999, 2001), York 
et al. (2000), Hogg et al. (2001), Pier et al. (2002), Stoughton et al. (2002), Smith et al. 
(2002) and Blanton et al. (2002a) for more details of the SDSS data and survey. 

The SDSS spectra are obtained using two fiber-fed spectrographs (each with 320 fibers), 
with each fiber sub-tending 3 arcseconds on the sky. The wavelength coverage of the spec- 
trographs is 3800A to 9200A, with a spectral resolution of 1800. The data from these 
spectrographs is automatically reduced to produce flux and wavelength-calibrated spectra 
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(SPECTR02D data analysis pipeline). 

The SDSS spectra are then analyzed via the SDSS SPECTR01D data processing 
pipeline to obtain a host of measured quantities for each spectrum (see Stoughton et al. 
2002; Frieman et al., in prep, for further details). For example, SPECTROID determines 
the redshift of the spectrum both from absorption lines (via cross-correlation; Heavens 1993), 
and emission lines (via a wavelet-based peak-finding algorithm; see Frieman et al., in prep). 
Once the redshift is known, SPECTROID estimates the continuum emission at each pixel 
using the median value seen in a sliding box of 100 pixels centered on that pixel. Emission 
and absorption lines are then measured automatically by fitting of a Gaussian, above the 
best-fit continuum, at the redshifted rest-wavelength of expected lines. Multiple Gaussians 
are fit simultaneously for potential blends of lines (i.e., Ha and [Nil] lines). SPECTROID 
therefore provides an estimate of the equivalent width (EW), continuum, rest wavelength, 
identification, goodness-of-fit (x 2 ), height and sigma (and the associated statistical errors 
on these quantities) for all the major emission/absorption lines in these spectra. These mea- 
surements are done regardless of whether the line has been detected or not. For this work, 
we have used data from rerun 15 of the SPECTROID analysis pipeline, which is based on 
version 4.9 of the SPECTR02D analysis pipeline (see Frieman et al. in prep for details of 
these pipelines). 

For the sample of HDS galaxies presented in this paper, we begin with a sample of the 
SDSS galaxies that satisfy the following selection criteria: 

1. Spectroscopically-confirmed by SPECTROID to be a galaxy; 

2. Possess a redshift confidence of > 0.7; 

3. An average spectroscopic signal-to-noise of > 5 per pixel in the SDSS photometric g 
passband; 

4. z>0.05, to minimize aperture effects as discussed in Zaritsky, Zabuldoff & Willick 
(1995) and Gomez et al. (2003). 

The reader is referred to Stoughton et al. (2002) for further details on all these SDSS 
quantities and how they are determined. After removing duplicate observations of the same 
galaxy (11538 galaxies in total; see Section 3.4), 106682 galaxies satisfy these criteria, up 
to and including spectroscopic plate 804 (observed on a Modified Julian Date of 52266 or 
12/23/01; see Stoughton et al. 2002). Of these 106682 galaxies, it was only possible to 
measure the H<5 line for 95479 galaxies (see Section 3.1 below) due to masked pixels at or 
near the H£ line. In Figure 1, we present the distribution of signal-to-noise ratios for all 
106682 spectra (the median value of this distribution is 8.3). 



-7- 



Throughout this analysis, we have used the "smeared" SDSS spectra, which improves 
the overall spectrophotometric calibration of these data by accounting for light missed from 
the 3" fibers due to seeing and atmospheric refraction (see Gomez et al. 2003; Stoughton et 
al. 2002 for observational detail). Unfortunately, this smearing correction can systematically 
bias the observed flux of any emission and absorption lines in the spectrum, as the correction 
is only applied to the continuum. As shown by Hopkins et al. in prep., this is only a ~ 10% 
effect on the flux of spectral lines, compared to using spectral data without the smearing 
correction applied. Furthermore, the equivalent width of our lines is almost unaffected by 
this smearing correction as, by definition, they are computed relative to the height of the 
continuum. 



3. Spectral Line Measurements 

3.1. H5 Equivalent Width 

In this Section, we discuss the measurement of the equivalent width (EW) of the B.S 
absorption line in the SDSS galaxy spectra described in Section 2. The presence of a strong 
US absorption line in a galaxy spectrum indicates that the stellar population of the galaxy 
contains a significant fraction of A stars, which must have formed within the last Gigayear 
(see Section 1). The H<5 line is preferred over other Hydrogen Balmer lines (e.g., He, H(, 
H7, H/3) because the line is isolated from other emission and absorption lines, as well as 
strong continuum features in the galaxy spectrum (e.g., D4000). Furthermore, the higher 
order Balmer lines (H7 and H/5) can suffer from significant emission-filling (see Section 3.3), 
while the lower order lines (He and H£) are low signal-to-noise in the SDSS spectra. 

In previous studies, several different methods have been employed to measure the B.S 
line, or select post-starburst galaxies. For example, Zabludoff et al. (1996) used the av- 
erage EW of the H/9 , H7 and H£ lines to select E+A galaxies. Alternatively, Dressier et 
al. (1999) and Poggianti et al. (1999) interactively fit Gaussian profiles to the B.5 line. 
Finally, Abraham et al. (1996), Balogh et al. (1999) and Miller & Owen (2002) performed 
a non-parametric analysis of their galaxy spectra, which involved summing the flux in a 
narrow wavelength window centered on each of the B.5 line to determine the EW of the line. 
Castander et al. (2001) used an innovative PCA and wavelet analysis of spectra to select 
E+A galaxies. Each of these methods have different advantages and dis-advantages. For 
example, fitting a Gaussian to the B5 line is optimal for high signal-to-noise spectra, but 
can be prone to erroneous results when fit blindly to low signal-to-noise data or a weak 
absorption lines (such problems can be avoided if Gaussians are fit interactively; see Dressier 
et al. 1999; Poggianti et al. 1999). In light of the potential systematic differences between 
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the different methods of measuring the H<5 line, we have investigated the relative merits of 
the two main approaches in the literature - fitting a Gaussian and summing the flux in a 
narrow wavelength window - for determining the EW of the H<5 line for the signal-to-noise, 
resolution, and size of the SDSS spectral dataset used in this paper. 

First, we investigate the optimal method for computing the EW of the B5 line from 
the SDSS spectra using the non-parametric methodology outlined in Abraham et al. (1996) 
and Balogh et al. (1999), i.e., summing the flux within narrow wavelength windows centered 
on and off the H<5 absorption line. We estimate the continuum flux via linear interpolation 
between two wavelength windows placed either side of the H<5 line (4030A to 4082A and 
4122A to 4170A). We used the same wavelength windows as in Abraham et al. (1996) 
and Balogh et al. (1999) for estimating the continuum because they are devoid of any 
strong emission and absorption features, and the continuum is relatively smooth within these 
wavelength ranges. Also, these windows are close to the B.S line without being contaminated 
by the B5 line emission. When fitting the continuum flux level, the flux in each pixel was 
weighted by the inverse square of the error on the flux in that pixel. After the initial fit to 
the continuum, we re-iterate the fit once by rejecting 3a outliers to the original continuum 
fit. This guards against noise spikes in the surrounding continuum. 

The rest-frame EW of the B5 line was calculated by summing the ratio of the flux in 
each pixel of the spectrum, over the estimated continuum flux in that pixel based on our 
linear interpolation. For this summation, we investigated two different wavelength windows 
for the B5 line; 4088A to 4116A, which is the same as the wavelength range used by Balogh 
et al. (1999) 13 and 4082A to 4122A, which is the wider range of Abraham et al. (1996). We 
summarize the wavelength ranges used to measure the B5 EWs in Table 1. 

In Figure 2, we compare the two non-parametric measurements of B5 (i.e., using the 
narrow and wide wavelength windows), and find, as expected, a strong linear relationship 
between the two measurements: The scatter about the best fit linear relationship to these 
measurements is Gaussian with a = 0.29A. However, there are systematic differences between 
the two measurements which are correlated to the intrinsic width of the B5 line. For example, 
for large EWs of B5, we find that the wider wavelength window has a larger value than the 
narrower window. This is because the 28A window is too small to capture the wings of 
a strong B5 line and thus a wider window is needed. Alternatively, for smaller EWs, the 
narrower 28A window is better as the larger window of 40A is more affected by noise as it 
contains more of the continuum flux than the narrow window. This is seen in Figure 2 where 



13 We note that Table 1 of Balogh et al. (1999) has a typographical error. The authors used the wavelength 
range of 4088A to 4116A to measure their H<5 EWs instead of 4082A to 4122A as quoted in the paper. 
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the larger window systematically over-estimates the EW of the weaker H<5 lines. 

As a compromise, we have empirically determined that the best methodology for our 
analysis is to always select the larger of the two B5 EW measurements (this was discovered 
by visually inspecting many of the spectra and their various H<5 measurements). This is a 
crude adaptive approach of selecting the size of the window based on the intrinsic strength 
of the H<5 line. In fact, we find that 20.2% of our H<5-selected galaxies (see Section 4) 
were selected based on the B5 measurement in the larger wavelength window. Therefore, 
for the analysis presented in this paper, we use H<5 max , which is the maximum of the two 
non-parametric measurements discussed above. 

In Figure 3, we now compare the H5 max measurement discussed above to the automatic 
Gaussian fits to the B5 line from the SDSS SPECTROID analysis of the spectra. As 
expected, the two methods give similar results for the EW of the B.5 line for the largest 
EWs. However, there are significant differences, as seen in Figure 3, between these two 
methodologies. First, there are many galaxies with a negative EW (emission) as measured 
by SPECTROID, but possess a (large) positive EW (absorption) using the non-parametric 
method. These cases are caused by emission-filling, i.e., a small amount of B5 emission at 
the bottom of the B5 absorption line (see Section 3.3). This results in SPECTROID fitting 
the Gaussian to the central emission line, thus producing a negative EW. On the other 
hand, the non-parametric method simply sums all the flux in the region averaging over the 
emission and still producing a positive EW. In Figure 4, we present five typical examples of 
this phenomenon. 

Another noticeable difference in Figure 3 is the deviation from the one-to-one relation 
for B5 EWs near zero, i.e., as the B5 line becomes weak, it is buried in the noise of the 
continuum making it difficult to automatically fit a Gaussian to the line. In such cases, 
SPECTROID tends to over-estimate the EW of the B5 line because it preferentially fits a 
broad, shallow Gaussian to the noise in the spectrum. Typical examples of this problem are 
shown in Figure 5. We conclude from our study of the SDSS spectra that the non-parametric 
techniques of Abraham et al. (1996) and Balogh et al. (1999) are preferred to the automatic 
Gaussian fits of SPECTROID, especially for the lower signal-to-noise SDSS spectra which 
are a majority in our sample (see Figure 1). We note that many of the problems associated 
with the automatic Gaussian fitting of SPECTROID can be avoided by fitting Gaussians 
interactively. However, this is not practical for large datasets such as the SDSS. 
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3.2. [On] and Ha Equivalent Widths 

In addition to estimating the EW of the B.5 line, we have used our flux-summing 
technique to estimate the rest frame equivalent widths of both the [On] and Ha emission 
lines. We perform this analysis on all SDSS spectra, in preference to using the SPECTR01D 
measurements of these lines, as these emission lines are the primary diagnostics of on-going 
star-formation in a galaxy and thus, we are interested in detecting any evidence of these 
lines in our HDS galaxies. As discussed in Section 3.1, the flux-summing technique is better 
for the lower signal-to-noise spectra, while the Gaussian-fitting method of SPECTROID is 
optimal for higher signal-to-noise detections of these emission lines, especially in the case 
of Ha where SPECTROID de-blends the Ha and [Nil] lines. 

We use the same flux-summing methodology as discussed above for the H£ line, except 
we only use one wavelength window centered on the two emission lines. We list in Table 1 
the wavelength intervals used in summing the flux for the [Oil] and Ha emission lines and 
the continuum regions around these lines. Once again, the continuum flux per pixel for each 
emission line was estimated using linear interpolation of the continuum estimated either side 
of the emission lines (weighted by the inverse square of the errors on the pixel values during 
a line fitting procedure). We again iterate the continuum fit once rejecting 3a outliers to the 
original continuum fit. We do not de-blend the Ha and [Nil] lines and as a result, some 
of our Ha EW measurements may be over-estimated. However the contamination is less 
than 5% from [Nil] line at 6648A and less than 30% from [Nil] line at 6583A. We present 
estimates of the external error on our measurements of [On] and Ha in Section 3.4. 

In Figure 7, we compare our [On] equivalent width measurements to that from SPEC- 
TROID for all SDSS spectra. In this paper, positive EWs are absorption lines and neg- 
ative EWs are emission lines. There is a good agreement between the two methods for 
EW([0n])> 10A, where the scatter is < 10%. However, at lower EWs, the SPECTROID 
measurement of [Oil] is systematically larger than our flux-summing method which is the 
result of SPECTROID fitting a broad Gaussian to the noise in the spectrum. Furthermore, 
our continuum measurements for the [On] line are estimated close to the [On] line, in a 
region of the spectrum where the continuum is varying rapidly with wavelength (e.g., D4000 
break). Once again, we are only concerned with making a robust detection of any [On] emis- 
sion, rather than trying to accurately quantify the properties of the emission line. Therefore, 
we prefer our non-parametric method, especially for the low signal-to-noise cases. 

In Figure 8, we compare our Ha equivalent width measurements against that of SPEC- 
TROID for all SDSS spectra regardless of their H5 EW. The two locii of points seen in 
this figure are caused by contamination in our estimates of Ha due to strong emission lines 
in AGNs, i.e., the top locus of points have larger EWs in our flux-summing method than 



- 11 - 



measured by SPECTR01D due to contamination by the [Nil] lines. This is confirmed by 
the fact that the top locus of points is dominated by AGNs. At low Ha EWs, we again see 
a systematic difference between our measurements and those of SPECTROID, with SPEC- 
TROID again over-estimating the Ha line because it is jointly fitting multiple Gaussians to 
low signal-to-noise detections of the Ha and [Nil] emission lines. Finally, we do not make 
any correction for extinction and stellar absorption on our flux-summed measurements of 
Ha(see Section 3.3). 

3.3. Emission— Filling of the H5 Line 

As mentioned above, our measurements of the H<5 absorption line can be affected by 
emission-filling, i.e., B5 emission at the bottom of the B5 absorption line. This problem 
could be solved by fitting two Gaussians to the B5 line; one for absorption, one for emission. 
We found however that this method is only reliable for spectra with a signal-to-noise of > 20 
and, as shown in Figure 1, this is only viable for a small fraction of our spectra. Therefore, 
we must explore an alternative approach for correcting for this potential systematic bias; 
however, we stress that the sense of any systematic bias on our non-parametric summing 
method would be to always decrease (less absorption) the observed EW of the B5 absorption 
line and thus our technique gives a lower limit to the amount of B5 absorption in the 
spectrum. 

To help rectify the problem of emission-filling, we have used the Ha and Bf3 emission 
lines (where available) to jointly constrain the amount of emission-filling at the B5 line 
as well as estimate the effects of internal dust extinction in the galaxy. Furthermore, our 
estimates of the emission-filling are complicated by the effects of stellar absorption on the 
Ha and Bf3 emission lines. In this analysis, we have used the SPECTROID measurements 
of Ha and Bf3 lines in preference to our flux-summing technique discussed in Section 3.2, 
because the emission-filling correction is only important in strongly star-forming galaxies 
where the Ha and Bf3 emission lines are well fit by a Gaussian and, for the Ha line, require 
careful de-blending from the [Nil] lines. 

To solve the problem of emission-filling, we have adopted two different methodologies 
which we describe in detail below. The first method is an iterative procedure that begins 
with a initial estimate for the amount of stellar absorption at the Bj3 and Ha emission lines, 
i.e., we assume EW (absorption) = 1.5A and Ha EW (absorption) = 1.9A (see Poggianti 
& Barbaro 1997; Miller & Owen 2002). Then, using the observed ratio of the Ha and B(3 
emission lines (corrected for stellar absorption), in conjunction with an attenuation law of 
r = A A~ a7 (Chariot & Fall 2000) for galactic extinction and a theoretical Ha to H/3 ratio 
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of 2.87 (case B recombination; Osterbrock 1989), we solve for A in the attenuation law and 
thus gain extinction-corrected values for both the Bf3 and Ha emission lines. Next, using 
the theoretical ratio of Bf3 emission to B5 emission, we obtain an estimate for the amount 
of emission-filling (extinction-corrected) in the B5 absorption line. We then correct the 
observed B5 absorption EW for this emission-filling and, assuming the EW(H<5) absorption 
is equal to EW(Bj3) absorption and EW(Ha) absorption is equal to 1.3 + 0.4x EW(H/3) 
absorption (Keel 1983), we obtain new estimates for the stellar absorption at the Ha and 
H/3 emission lines, i.e., where we begun the iteration. We iterate this calculation five times, 
but on average, a stable solution converges after only one iteration. 

Our second method uses the D4000 break to estimate the amount of stellar absorption 
at H/3, using EW(H/3) = -5.5 x D4000 + 11.6 (Poggianti & Barbara 1997; Miller & Owen 
2002). Then, assuming EW(Ha) absorption is equal to 1.3 + 0.4 x EW(H/3) absorption, 
we obtain an measurement for the amount of stellar absorption at both the Ha and H/3 
absorption lines. As in the first method above, we use the Chariot & Fall (2000) attenuation 
law, and the theoretical Ha to Bj3 ratio, to solve for the amount of extinction at Ha and Bf3, 
and then use these extinction-corrected emission lines to estimate the amount of emission- 
filling at B5. We do not iterate this method, as we have used the measured D4000 break to 
independently estimate the amount of stellar absorption at Bf3 and Ha. 

We have applied these two methods to all our SDSS spectra, except for any galaxy that 
possesses a robust detection of an Active Galactic Nucleus (AGN) based on the line indices 
discussed in Kewley et al. (2002) and Gomez et al. (2003). For these AGN classifications, we 
have used the SPECTROID emission line measurements. We also stop our emission-filling 
correction if the ratio of the H/3 and Ha line becomes unphysical, i.e., greater than 2.87. By 
definition, the emission-filling correction increases our observed values of the B5 absorption 
line, with a median correction of 15% in the flux of the B.5 absorption line. In Figure 6, 
we show the distributions of B5 emission EWs calculated using the two methods described 
above in a solid (iteration) and a dashed (D4000) line, respectively. It is re-assuring that 
overall these two methods give the same answer and have similar distributions. 

3.4. External Errors on our Measured Equivalent Widths 

Before we select our HDS sample of galaxies, it is important to accurately quantify the 
errors on our EW measurements. In our data, there are 11538 galaxies spectroscopically 
observed twice (see Section 2), which we use to quantify the external error on our EW 
measurements. In Figures 9, 10 & 11, we present the absolute difference in equivalent width 
of the two independent observations of the B5 , Ha and [On] lines, as a function of signal- 
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to-noise. In this analysis, we have used the lower of the two measured signal-to-noise ratios 
(in SDSS g band for H<5 and Bf3, or r band for Ha) as any observed difference in the two 
measurements of the EW will be dominated by the error in the noisier (lower signal-to-noise) 
of the two spectra. From this data, we determine the la error for each line and assign this, 
as a function of signal-to-noise, to our EW measurements for each galaxy. We determine the 
sigma of the distribution by fitting a Gaussian (as a function of signal-to-noise) as shown 
in Figures 12, 13 & 14, and then use a 3rd order polynomial to interpolate between the 
signal-to-noise bins, thus obtaining the solid lines shown in Figures 9, 10 & 11. Using these 
polynomial fits, we can estimate the la error on our EWs for any signal-to-noise. The 
coefficients of the fitted 3rd order polynomial for each line are given in Table 2. 

In addition to quantifying the error on [On], Ha and B5, we have used the duplicate 
observations of SDSS galaxies to determine the error on our emission-filling corrections. 
Only 400 (564) of the 11538 duplicate observations of SDSS galaxies have strong Ha and 
Bf3 emission lines which are required for the iterative (D4000) method of correcting for 
emission-filling. The errors on the emission-filling correction are only a weak function of 
signal-to-noise, so we have chosen to use a constant value for their error, rather than varying 
the error as a function of the galaxy signal-to-noise as done for B.5, [Oil] and Ha emission 
lines. One sigma errors on the emission correction of B5 for the iterative method (EF1) and 
the D4000 method (EF2) are 0.57A and 0.4A in EW, respectively. 



4. A Catalog of HDS Galaxies 

We are now ready to select our sample of HDS galaxies using the non-parametric mea- 
surements of the B5 EW (i.e., EW(H5 max )). We begin by imposing the following threshold 
on EW(H5 max ); 

EW(H5 max ) - AEW(H<W) > 4A, (1) 

where AEW(H5 max ) is the la error on B5 based on the signal-to-noise of the spectrum (see 
Figure 9). We have chosen this threshold (4A) based on visual inspections of the data and 
our desire to select galaxies similar to those selected by other authors (Zabludoff et al. 1996; 
Balogh et al. 1999; Poggianti et al. 1999), i.e., galaxies with strong recent star formation 
as defined by the B5 line. This threshold (Eqn. 1) is applied without any emission-filling 
correction. For the signal-to-noise ratios of our spectra (Figure 1), only galaxies with an 
observed B5 of <; 5 A satisfy Eqn. 1, which is close to the 5A threshold used by Balogh et al. 
(1999) to separate normal star-formating galaxies from post-starburst galaxies (see Figures 
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8 & 9 in their paper). Therefore, our HDS sample should be similar to those already in the 
literature, but is still conservative enough to be inclusive of the many different subsamples 
of H<5-strong galaxies, like k+a, a+k, A+em and e(a), as discussed in Pogginati et al. (1999) 
and Balogh et al. (1999). We will present a detailed comparison of our HDS sample with 
models of galaxy evolution in future papers. 

We call the sample of galaxies that satisfy Eqn. 1 "Sample 1" , and the sample contains 
2526 galaxies from the main SDSS galaxy sample and 234 galaxies targeted for spectroscopy 
for other reasons, e.g., mostly because they were LRG galaxies (see Eisenstein et al. 2002), 
or some were targeted as stars or quasars (see Gordon et al. 2002). 

We now apply the emission-filling correction for each galaxy to EW(H5 max ) and select 
an additional sample of galaxies, that were not already selected in Sample 1 via Eqn 1, but 
now satisfy both the following criteria; 

EW(fW max ) - AEW(H5 max ) - AEW(EFl) > 4A, (2) 

EW(H5 max ) - AEW(HcW) - AEW(EF2) > 4A, (2) 

where AEW(EFl) = 0.57A and AEW(EF2) = 0.4A are the la errors on the iterative method 
(EF1) and D4000 method (EF2) of correcting for emission-filling discussed in Section 3.4. 
Therefore, this additional sample of galaxies represents systems that would only satisfy the 
threshold in Eqn 1 because of the emission-filling correction (in addition to galaxies already 
selected as part of Sample 1). We call this sample of galaxies "Sample 2" and it contains 
483 galaxies from the main SDSS galaxy sample and 97 galaxies which were again targeted 
for spectroscopy for other reasons, e.g., LRG galaxies, stars or quasars. On average, Sample 
2 galaxies have strong emission lines because, by definition, they have the largest emission- 
filling correction at the H<5 line. We have imposed both of these criteria to control the 
number of extra galaxies scattered into the sample. If we relaxed these criteria (i.e., remove 
both AEW(EFl) and AEW(EF2)), then the sample would increase from 483 galaxies (in 
the main galaxy sample) to 1029. For completeness, we provide the extra 546 galaxies, which 
would be included if these criteria were relaxed, on our webpage. 

In total, 3340 SDSS galaxies satisfy these criteria (Sample 1 plus Sample 2), and we 
present these galaxies as our catalog of HDS galaxies. We note that only 131 of these 
galaxies are securely identified as AGNs using the prescription of Kewley et al. (2002) and 
Gomez et al. (2003). In Figure 15, we present the fraction of HDS galaxies selected as a 
function of their signal-to-noise in SDSS g band. It is re-assuring that there is no observed 
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correlation, which indicates that our selection technique is not biased by the signal-to-noise 
of the original spectra. 

For each galaxy in Samples 1 and 2, we present the unique SDSS Name (col. 1), 
heliocentric redshift (col. 2), spectroscopic signal-to-noise in the SDSS photometric g band 
(col. 3), Right Ascension (J2000; col. 4) and Declination (J2000; col. 5) in degrees, Right 
Ascension (J2000; col. 6) and Declination (J2000; col. 7) in hours, minutes and seconds, 
the rest-frame EW(H5) (A, col. 8), the rest-frame AEW(H<5) (A, col. 9), the rest-frame 
EW([On]) (A, col. 10), the rest-frame AEWQOll]) (A, col. 11), the rest-frame EW(Ha) 
(A, col. 12), the rest-frame AEW(Ha) (A, col. 13), the SDSS Petrosian g band magnitude 
(col. 14), the SDSS Petrosian r band magnitude (col. 15), the SDSS Petrosian i band 
magnitude (col. 16), the SDSS Petrosian z band magnitude (col. 17; all magnitudes are 
extinction corrected), the k-corrected absolute magnitude in the SDSS r band (col. 18), 
SDSS measured seeing in r band (col. 19), concentration index (col. 20, see Shimasaku 
et al. 2001 and Strateva et al. 2001 for definition). In Column 21, we present the AGN 
classification based on the line indices of Kewley et al. (2002), and in Column 22, we present 
our E+A classification flag, which is defined in Section 5.1. An electronic version of our 
catalog can be obtained at http://astrophysics.phys.cmu.edu/~tomo/ea. 14 

In addition to presenting Samples 1 and 2, we also present a volume-limited sample 
selected from these two samples but within the redshift range of 0.05 < z < 0.1 and with 
M(r*) < —20.5 (which corresponds to r = 17.7 at z — 0.1, see Gomez et al. 2003). We 
use Schlegel, Finkbeiner & Davis (1998) to correct for galactic extinction and Blanton et al. 
(2002b; vl_ll) to calculate the k-corrections. In Table 3, we present the percentage of HDS 
galaxies that satisfy our criteria. In this table, the number of galaxies in the whole sample 
(shown in the denominator) changes based on the number of galaxies that could have had 
their B.5, [On] and Ha lines measured because of masked pixels in the spectra. 

We note here that we have not corrected our sample for possible aperture effects, except 
restrict the sample to z > 0.05: A 3 arcsecond fiber corresponds to 2.7hj^ kpc at this 
redshift, which is comparable to the half-light radius of most our galaxies (see also Gomez 
et al. 2003). We see an increase of 0.33A ( < 10%) in the median observed R5 EW for the 
whole HDS sample over the redshift range of our volume-limited sample (0.05 < z < 0.1). 
This is probably caused by more light from the disks of galaxies going down the fiber at 
higher redshifts. We see no such trend for the subsample of true E+A galaxies (see Section 
5) in our HDS sample. 



14 Mirror sites are available at http://kokki.phys.cmu.edu/~tomo/ea, http://sdss2.icrr.u- 
tokyo .ac.jp/ ~yohnis /ea 
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5. Discussion 

In this Section, we compare our sample of HDS galaxies against previous samples of such 
galaxies in the literature. Further analysis of the global properties (luminosity, environment, 
morphology, etc.) of these galaxies will be presented in future papers. 

5.1. Comparison with Previous Work 

In this Section, we present a preliminary comparison of our SDSS HDS galaxies with 
other samples of post-starburst galaxies in the literature. We have attempted to replicate 
the selection criteria of these previous studies as closely as possible, but in some cases, this 
is impossible. Furthermore, we have not attempted to account for systematic differences in 
the distribution of signal-to-noises, differences in the spectral resolutions, and differences 
in the selection techniques used by different authors, e.g., we are unable to fully replicate 
the criteria of Zabludoff et al. (1996) as we do not yet possess accurate measurements of 
the H7 and H/5 lines. Therefore, these are only crude comparisons and any small differences 
seen between the samples should not be over-interpreted until a more detailed analysis can 
be carried out. We summarize our comparison in Table 4 and discuss the details of these 
comparisons below: Table 4 does however, demonstrate once again the rarity of FW-strong 
galaxies, especially at low redshift, as well as illustrating the sensitivity of their detection to 
the selection criteria used. 

We first compare our sample to that of Zabludoff et al. (1996), which is the most similar 
to our work, especially as the magnitude limits of the LCRS (used by Zabludoff et al. 1996) 
and the SDSS are close, thus minimizing possible bias. Zabludoff et al. (1996) selected 
E+A galaxies from the LCRS using the following criteria; a redshift range of 15, 000 < 
cz < 40, 000 km s -1 , a signal-to-noise of > 8 per pixel, an EW of [On] of > — 2.5A, and an 
average EW for the three Balmer lines (H5, H7 and H/3) of > 5.5A. Using these four criteria, 
Zabludoff et al. (1996) selected 21 LCRS galaxies as E+A galaxies, which corresponds to 
0.2% of all LCRS that satisfy the same signal-to-noise and redshift limits. We find 80 SDSS 
galaxies (from the whole SDSS dataset analyzed here) satisfy the same redshift range and 
[On] EW detection limit as used by Zabludoff et al. (1996), as well as having B.5 EW of 
>5.5A, which should be close to the average of the EW of the three Balmer lines (H<5, H7 
and H/9) used by Zabludoff et al. (1996). Of these 80 galaxies, 71 are in our HDS sample. 
Given all the caveats discussed above, it is re-assuring that we have found HDS galaxies at 
a similar frequency (see Table 4) as Zabludoff et al. (1996) and it suggests that our criteria 
are consistent. 



-17- 



Several other authors have used similar criteria to Zabludoff et al. (1996) to search for 
post-starburst galaxies in higher redshift samples of galaxies. For example, Fisher et al. 
(1998) used an average EW of the RS, H 7 and R{3 lines of > 4A and an EW of [On] of 
< 5A. With these criteria, they found 4.7% of their galaxies were E+A galaxies. Similarly, 
Hammer et al. (1997) used an average of H<5, H 7 and Rf3 of > 5.5A, an EW of [On] of 
<5-10 A and Mb <-20 to select E+A galaxies from the CFRS field galaxy sample. They 
found that 5% of their galaxies satisfied these criteria. 

We have also attempted to replicate the selection criteria of Dressier et al. (1999) and 
Balogh et al. (1999) as closely as possible, using all SDSS galaxies in the volume-limited 
sample with measured H<5, [On] and Ha, and not simply the HDS subsample. For example, 
the MORPHS collaboration of Dressier et al. (1999) and Poggianti et al. (1999) selected 
E+A galaxies using a H<5 EW of > 3 A and an [Oil] EW of < 5A . Using these criteria, they 
found a significant excess of E+A galaxies in their 10 high redshift clusters (21%), compared 
with the field region (6%). Alternatively, Balogh et al. (1999) selected E+A galaxies using 
E5 EW of > 5A and an [On] EW of < 5A . They found instead 1.5 ± 0.8% of cluster 
galaxies were classified as E+A galaxies compared to 1.2 ± 0.8% for the field (brighter than 
M(r) = —18.8 + 51og/i after correcting for several systematic effects). 

In Table 4, we present a qualitative comparison of our HDS sample with these higher 
redshift studies and, within the quoted errors, the frequencies of HDS galaxies we observe 
are consistent with these works. However, we caution the reader not to over-interpret these 
numbers for several reasons. First, we are comparing a low redshift sample (z < 0.1) of HDS 
galaxies to high redshift studies (z ~ 0.5) of such galaxies, and we have not accounted for 
possible evolutionary effects or observational biases. In particular, we are comparing our 
sample against the corrected numbers presented by Balogh et al. (1999), which attempt 
to account for scatter in the tail of the B.S distribution due to the large intrinsic errors on 
R5 measurements, while Dressier et al. (1999) do not make such a correction. Secondly, 
we are comparing a field sample of HDS galaxies to predominantly cluster-selected samples 
of galaxies (were available, we only quote in Table 4 frequencies based on field samples). 
Finally, the luminosity limit of our volume-limited sample is brighter than the high redshift 
studies, which may account for some of the discrepancies. 

Finally, we note that the original E+A phenomenon in galaxies, as discussed by Dressier 
& Gunn (1983, 1992), was defined to be a galaxy that possessed strong Balmer absorption 
lines, but with no emission lines, i.e., a galaxy with the signature of recent star-formation 
activity (A stars), but no indication of on-going star-formation (e.g., nebular emission lines). 
Given the quality of the SDSS spectra, we can re-visit this specific definition and select 
galaxies from our sample that possess less than la detections of both the Ha and [On] 
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emission lines, i.e., EW([OII]) + AEWQOII]) > OA and EW(Ha) + AEW(Ha) > OA). We 
find that only 3.5 ±0.7% (25/717) of galaxies in the volume limited HDS sample satisfy such 
a strict criteria (see Table 3). We show example spectra of these galaxies in Figure 16 and 
highlight them in the catalog using the E+A classification flag. This exercise demonstrates 
that true E+A galaxies - with no, or little, evidence for on-going star-formation - are 
extremely rare at low redshift in the field, i.e., 0.09 ± 0.02% of all SDSS galaxies in our 
volume-limited sample. 

5.2. HDS Galaxies with Emission lines 

In this Section, we examine the frequency of nebular emission lines ([On], Ha) in the 
spectra of our HDS galaxies. This is possible because of the large spectral coverage of 
the SDSS spectrographs which allow us to study both the [Oil] and Ha emission lines 
for all galaxies out to a redshift of ~ 0.35. We begin by looking at HDS galaxies that 
possess both the Ha and [On] emission lines. Using the criteria that both [On] and 
Ha lines must be detected at > la significance (i.e., EW([OII]) + AEW([OII]) < OA and 
EW(Ha) + AEW(Ha) < OA), we find that 89±5% (643/717) of HDS galaxies in our volume- 
limited sample are selected. Of these, 131 HDS galaxies possess a robust detection of an AGN, 
based on the line indices of Kewley et al. (2001), similar to the AGN plus post-starburst 
galaxy found recently in the 2dFGRS (see Sadler, Jackson & Cannon 2002). Therefore, a 
majority of these emission line H<5 galaxies may have on-going star formation and are similar 
to the e(a) and A+em subsample of galaxies discussed by Poggianti et al. (1999) and Balogh 
et al. (1999). We show in Figure 17 examples of these HDS galaxies that possess both the 
[On] and Ha emission lines. The median SFR of these galaxies (calculated from Ha flux, 
see Kennicut 1998) is ~ O.5M /yr, with a maximum observed SFR of 5OM /yr. We note 
that these SFRs have not been corrected for dust extinction or aperture effects, which could 
result in them being a factor of 5 to 10 lower than the true SFR for the whole galaxy (see 
Hopkins et al. in prep). 

We next examine the frequency of HDS galaxies with [On] emission, but no detectable 
Ha emission. Using the criteria of EW(Ha) <1 a detection and EW([On]) > \o detection 
(i.e., EW([OII]) + AEWQOII]) < OA and EW(Ha) + AEW(Ha) > OA), we find 2.9±0.65% 
(21/717) of our HDS galaxies in the volume-limited sample satisfy these cuts. The presence 
of [On] demonstrates that the galaxy may possess on-going star formation activity, yet the 
lack of the Ha emission is curious. Possible explanations for this phenomena are strong 
self-absorption of Ha by the many A stars in the galaxy and/or metallicity effect which 
could increase the [On] emission relative to Ha emission. We show several examples of 
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these galaxies in Figure 18, and the lack of Ha emission is clearly visible. The fact that 
many of these galaxies possess strong [Nil] lines (flanking the Ha line) indicates strong 
self-absorption is a likely explanation for the missing Ha emission line. Median [Oil] EW 
of these galaxies is 1.3 A. Compared with 11.5 Aof HDS galaxies with both [Oil] and Ha 
emission, these galaxies have much lower amount of [Oil] in emission. 

Finally, we find that 3.8±0.7% (27/717) of our HDS galaxies in our volume-limited sam- 
ple satisfy the following criteria; EW([OII]) + AEWQOII]) > OA and EW(Ha) + AEW(Ha) < OA. 
In comparison, only 52 of our HDS galaxies, the volume limited sample, have just no [On] 
emission detected (only EWQOII]) + A EW([OII]) >0A). Therefore, 52±12% (27/52) of the 
galaxies with no detected [On], have detected Ha emission. The existence of such galax- 
ies has ramifications on high redshift studies of post-starburst galaxies, as such studies use 
the [Oil] line to constrain the amount of on-going star-formation within the galaxies e.g., 
Balogh et al. (1999), Poggianti et al. (1999). Therefore, if the Ha emission comes from 
star-formation activity, then these previous high redshift studies of post starburst galaxies 
may be contaminated by such galaxies. A possible explanation for the lack of [Oil] emission 
is dust extinction e.g., Miller & Owen (2002) recently found dusty star-forming galaxies 
which do not possess [On] in emission, but have radio fluxes consistent with on-going star 
formation activity. This explanation would also be consistent with the low signal-to-noise 
we observe in the blue-end of the SDSS spectra of these galaxies, relative to the signal- 
to-noise seen in the red-end of their spectra. Median Ha EW of these galaxies are 1.5A, 
whereas that of HDS galaxies with both [Oil] and Ha emission is 25.9A. 



6. Conclusions 

We present in this paper the largest, most homogeneous, search yet for H<5-strong galax- 
ies (i.e., post-starburst galaxies, E+A's, k+a's, a+k's etc.) in the local universe. We pro- 
vide the astronomical community with a new catalog of such galaxies selected from the 
Sloan Digital Sky Survey (SDSS) based solely on the observed strength of the H<5 Hydro- 
gen Balmer absorption line within the spectrum of the galaxy. We have carefully studied 
different methodologies of measuring this weak absorption line and conclude that a non- 
parametric flux-summing technique is most suited for an automated application to large 
datasets like the SDSS, and more robust for the observed signal-to-noise ratios available 
in these SDSS spectra. We have studied the effects of dust extinction, emission-filling and 
stellar absorption upon the measurements of our US lines and have determined the external 
error on our measurements, as a function of signal-to-noise, using duplicate observations 
of 11358 galaxies in the SDSS. In total, our catalog of H5-selected (HDS) galaxies contains 
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3340 of the 95479 galaxies in the Sloan Digital Sky Survey (at the time of writing) and we 
present these HDS galaxies to the community to help in the understanding of such systems 
and to aid in the comparison with higher redshift studies of post-starburst galaxies. 

The measured abundance of our H5-selected (HDS) galaxies is 2.6±0.1% of all galaxies 
within a volume-limited sample of 0.05 < z < 0.1 and M(r*)< —20.5, which is consistent 
with previous studies of post-starburst galaxies in the literature. We find that only 25 
galaxies (3.5±0.7%) of HDS galaxies in this volume limited sample show no, or little, evidence 
for [On] and Ha emission lines, thus indicating that true E+A galaxies (as originally defined 
by Dressier & Gunn) are extremely rare objects, i.e., only 0.09 ±0.02% of all galaxies in our 
volume-limited sample. In contrast, 89 ±5% of our HDS galaxies have significant detections 
of the [Oil] and Ha emission lines. Of these, 131 galaxies are robustly classified as Active 
Galactic Nuclei (AGNs) and therefore, a majority of these emission line HDS galaxies are 
star-forming galaxies, similar to the e(a) and A+em galaxies discussed by Poggianti et al. 
(1999) and Balogh et al. (1999). We will study the global properties of our HDS galaxies in 
further detail in future papers. 
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Fig. 1. — The distribution of signal-to-noise for spectra used in this analysis (see Section 
2). The signal-to-noise presented here is the average signal-to-noise per pixel over the 
wavelength range defined by the SDSS photometric g passband. The median signal-to-noise 
value is 8.3. Galaxies with signal-to-noise less than 5 were not used in our study. 
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Fig. 2. — The H<5 EW (A) as measured in two different wavelength windows, i.e., the wide 
window of Abrahams et al. (1996) and the narrow window of Balogh et al. (1999). The 
expected one-to-one line is plotted to help guide the eye. For the work presented in this 
paper, absorption lines have a positive EW values and emission lines have negative EW 
values. 



-5 4 5 10 

H<5 ma „ EW (A) 

Fig. 3. — A comparison of the R5 EW as measured automatically by the SDSS SPECTROID 
spectroscopic pipeline (a Gaussian fit to the B.5 line) and the non-parametric summation 
technique discussed in this paper and presented in Figure 2. The one-to-one line is shown 
to guide the eye. In our work, absorption lines have positive EW values and emission lines 
have negative EW values. 
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Fig. 4. — Five typical examples of SDSS spectra with H5 emission filling. In such cases it is 
difficult to fit the H<5 absorption emission with a single Gaussian due to a centrally peaked 
emission. The double hashed region of this figure, centered on the B.5 line, represents the 
narrow wavelength window used to measure the EW of H5 as explained in Section 3.1. The 
slightly wider hashed region, again centered on the B.S line, represented the wide wavelength 
window used to measure the H<5 line (see Section 3.1). Finally, the dashed regions, one on 
each side of the hashed regions, represents the wavelength regions used to estimate the 
continuum flux. See also Table 1 for details of the wavelength windows used on measuring 
the H<5 line. 
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Fig. 5. — Five examples of noisy spectra where the SDSS SPECTR01D pipeline has fit a 
broad absorption line, thus overestimating the H<5 EW. The hashed regions are the same as 
presented and discussed in Figure 4. 
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Fig. 6. — The amount of emission filling correction of H5 EW. A solid line is for the iteration 
method (EF1) and the shaded histogram uses the D4000 method (EF2). 
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Fig. 7. — In the left panel, we present the comparison of our [On] EW measurements (flux 
summing) and those of SPECTROID (Gaussian fitting) for all SDSS spectra regardless of 
their B.S EWs. In the right panel, we plot the percentage difference between these two 
measurements. Positive percentages mean our flux summing method has a larger value. 
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Fig. 8. — In the left panel, we present the comparison of our Ha EW measurements (flux 
summing) and those of SPECTR01D (Gaussian fitting) for all SDSS spectra regardless of 
their B.S EWs. In the right panel, we plot the percentage difference between these two 
measurements. Positive percentages mean our flux summing method has a larger value. 
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Fig. 9. — The absolute difference in the measured B.S EW (A) for duplicate observations of 
SDSS galaxies as a function of signal-to-noise (the lower of the two signal-to-noise ratios 
has been plotted here). The solid line shows the la polynomial line fitted to the distribution 
of errors (as a function of signal-to-noise). 
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Fig. 10. — The absolute difference in the [On] EW (A) for duplicate observations of SDSS 
galaxies as a function of signal-to-noise (the lower of the two signal-to-noise ratios has been 
plotted here). The solid line shows the la polynomial line fitted to the distribution of errors 
(as a function of signal-to-noise). 
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Fig. 11. — The absolute difference in the Ha EW (A) for duplicate observations of SDSS 
galaxies as a function of signal-to-noise (the lower of the two signal-to-noise ratios has been 
plotted here). The solid line shows the la polynomial line fitted to the distribution of errors 
(as a function of signal-to-noise). 
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Fig. 12. — We present the distribution of differences for the H<5 line from our duplicate 
observations. The four panels denote four different bins in signal-to-noise, i.e., clockwise 
from the top-left panel, we have s/n < 7, 7 < s/n < 10, 15 < s/n < 20 and 10 < s/n < 15. 
We show as a dotted line the best fit gaussian to these distributions, which was then used to 
determine the la error (shown for each panel) on H<5 EW as a function of signal-to-noise. 
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Fig. 13. — We present the distribution of differences for the [On] line from our duplicate 
observations. The four panels denote four different bins in signal-to-noise, i.e., clockwise 
from the top-left panel, we have 5 < s/n < 10, 10 < s/n < 15, 20 < s/n < 25 and 
15 < s/n < 20. We show as a dotted line the best fit gaussian to these distributions, which 
was then used to determine the la error on [On] EW as a function of signal-to-noise. 
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Fig. 14. — We present the distribution of differences for the Ha line from our duplicate 
observations. The four panels denote four different bins in signal-to-noise, i.e., clockwise 
from the top-left panel, we have 5 < s/n < 10, 10 < s/n < 15, 20 < s/n < 25 and 
15 < s/n < 20. We show as a dotted line the best fit gaussian to these distributions, which 
was then used to determine the lex error on Ha as a function of signal-to-noise. 
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Fig. 15. — The fraction of HDS galaxies as a function signal-to-noise in the g band. The 
error bars are v^iV, where N is the number of galaxies in each bin. 
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Fig. 16. — We show here five examples of spectra for the "true E+A" (k+a) subsample of 
HDS galaxies discussed in Section 5. These galaxies possess strong Balmer absorption lines, 
but have no, or little, detected [On] or Ha emission. We present a range of signal-to-noise 
ratios, as well as provide the measured redshift and H<5 EW for each galaxy. 
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Fig. 17. — We present here five example spectra of our HDS galaxies that possess detected 
[On] and Ha emission lines. We present a range of signal-to-noise ratios, as well as provide 
the measured redshift and H5 EW for each galaxy. 
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Fig. 18. — We present here five example spectra of our HDS galaxies that possess detected 
[On] emission lines, but no detected Ha. We present a range of signal-to-noise ratios, as 
well as provide the measured redshift, H<5 EW and name for each galaxy. 
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Table 1: The wavelength ranges used to measure our B.5, [On] and Ha EWs. 





Blue continuum 


Line 


Red continuum 


H5 (narrow) 


4030-4082A 


4088-4016A 


4122-4170A 


R5 (wide) 


4030-4082A 


4082-4022A 


4122-4170A 


[On] 


3653-3713A 


3713-3741A 


3741-3801A 


Ha 


6490-6537A 


6555-6575A 


6594-6640A 



Table 2: Coefficients of third order polynomial fits to the error distributions shown in Figures 
9, 10 and 11. 



Line 


a 


ax 


a 2 


a 3 


m 


2.98 


-0.28 


0.012 


-0.00018 


[On] 


4.96 


-0.39 


0.014 


-0.00016 


Ha 


3.74 


-0.36 


0.014 


-0.00017 



Table 3: The frequency of finding HPS galaxies. 



Category 


% (All galaxies) 


% (Volume Limited) 


Whole HDS sample 
True "E+A" 


3340/95479 (3.50±0.06%) 
140/94770 (0.15±0.01%) 


717/27014 (2.6±0.1%) 
25/26863 (0.09±0.02%) 
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Table 4: A comparison of our HDS sample of galaxies to previous work in the literature. 



Author 


Balmer lines 


Emission 


Their % (field) 


Our % 


Zabludoff et al. 
Poggianti et al. 
Balogh et al. 


E5 >5.5A 
m >3 A 
H5 >5 A 


[OII]>-2.5A 
[OII]>-5 A 
[OII]>-5 A 


0.19±0.04% 
6±3% 
1.2 ± 0.8% 


0.16±0.02% (80/49994) 
5.79±0.15% (1565/27014) 
0.74±0.05% (200/27014) 



